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metric  space  by  generalized  rational  functions  for  which  the  denominators 
have  bounded  coefficients  and  are  bounded  below  by  a fixed  positive  function. 
This  lower  bound  alleviates  numerical  difficulties,  and  in  some  applications 
(e.g.,  digital  filter  design)  has  a useful  physical  interpretation.  A vzero 
in  the  convex  hull^  characterization  of  best  approximations  is  developed  and 
used  to  prove  uniqueness  and  de  la  Vallee Poussin  results.  Examples  are 
given  to  illustrate  this  theory  and  its  differences  with  the  standard  theory, 
where  the  denominators  are  merely  required  to  be  positive.  A modified 
differential  correction  algorithm  is  presented  and  is  proved  to  always 
converge  at  least  linearly,  and  often  quadratically. 


1.  Introduction 


In  this  paper  we  consider  approximation  of  continuous  functions  by 
generalized  rational  functions  whose  denominators  are  required  to  be 
bounded  away  from  zero.  This  is  in  contrast  to  the  standard  theory  where 
the  denominators  are  only  required  to  be  positive.  There  are  at  least 
four  reasons  for  having  this  stronger  requirement,  namely 

1)  best  approximations  always  exist; 

2)  iterative  procedures  for  computing  rational  approximations  may 
experience  numerical  difficulties  if  the  denominators  of  the 
rational  functions  become  very  small; 

3)  evei  if  a good  rational  approximation  is  found,  it  may  not  be 
very  useful  if  its  denominator  is  too  small  at  some  point; 

4)  there  may  be  some  physical  advantage  in  being  able  to  control 
the  denominator.  For  example,  McCall ig  [6]  has  used  a version 

of  the  algorithm  described  in  this  paper  to  compute  approximations 
to  the  desired  magnitude-squared  response  of  a digital  filter; 
control  of  the  denominator  of  the  rational  function  amounts  to 
control  of  the  feedback  gain  of  the  resulting  filter,  which  allows 
one  to  progress  smoothly  from  "fully  recursive"  filter  designs  to 
nonrecursive  designs,  and  to  reduce  sensitivity  problems  and  hardware 
requirements. 

Formally,  the  situation  we  are  considering  is  as  follows.  X is  a compact 
metric  space,  m and  n are  fixed  positive  integers,  P and  Q are  subspaces  of 

C[X]  with  bases  (e-j,  ....  em)  and  (ip-j ^n},  respectively,  and  L is  a 

strictly  positive  continuous  function  on  X (which  is  often  a constant  in 
practice).  Our  family  of  approximating  functions  is  then  defined  to  be 

\ = CP/Q:  P = P1e1  + ...  + pmem€:P,  Q = q1*1  + ...  + qn<|>ncQ, 


Q > L on  X,  |q^ | <.  1 for  j = 1 , ...»  n}. 
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We  note  in  passing  that  without  the  restrictions  |q.|  <_  1,  the  restric- 

J 

tion  Q >.  L would  be  no  stronger  than  the  more  usual  restriction  Q > 0, 
since  it  could  always  be  satisfied  by  multiplying  P and  Q by  a sufficiently 
large  positive  constant. 

In  order  that  RL  be  nonempty,  Q must  contain  at  least  one  strictly 

positive  function,  so  without  loss  of  generality  we  will  assume  > 0 on 

X.  To  insure  RL  ^ 0 and  for  other  reasons  which  will  be  clearer  later, 

we  will  also  assume  max  L(x)  < min  iMx);  this  requirement  is  no  restriction 

xeX  xeX 

in  practice,  since  it  can  always  be  obtained  by  multiplying  ill.  by  a suitable 
positive  constant. 

Given  f ec[X],  a best  approximation  to  f is  defined  to  be  a function 

R*€R^  such  that  ||f  - R*||  £ ||f  - R||  for  all  RgRl>  where  for  ge  C[X], 

||g||  = max | g(x) ] . The  following  theorem  can  be  proved  by  standard  techniques. 
xeX 

THEOREM  1 . If  f £ C[X],  then  there  exists  a best  approximation  to  f from 

V 

In  the  remaining  sections  we  will  consider  characterization  of  best 
approximations,  uniqueness,  de  La  Vallde  Poussin  results,  and  computation 
of  best  approximations  by  a modified  differential  correction  algorithm. 

The  theory  differs  in  some  respects  from  the  restricted  range  situation, 
where  the  entire  rational  function  rather  than  just  the  denominator  is 
restricted. 

Suppose  f £ C[X]  and  R*  £RL,  where  R*  = P*/Q*  * (P*0-|  + ...  + p*em)/ 

(q*</»l  + ...  + q*ipn) . The  following  notation  will  be  useful. 
o(x)  = f (x)  - R*(x)  vx  e X; 


P + R*0  = {P  + R*Q:  Pep,  Q £(?}; 
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s; 


XQ  = {x  e X:  | f (x ) - R*(x)|  = ||f  - R*||  }; 

Y0  = {y  ^ X:  Q*(y)  = L(y)}; 

I0  = {j  € {1,  ....  n>:  |qj|  = 1}; 

S = {a(x)x:  x € X0>  U (,£(y):  y <=  YQ}  U {qje^j:  J e I0>  where 
(®T (x) , ....  em(x) , R*(x)t|;1(x),  ....  R*(x)<J<n(x))T, 
jjj(y)  = (0,  •••»  0,  -♦<! (y),  •••»  “^p(y))  and  e^  = ( 6 1 » ...»  S^p  l^)» 
5^4  * Kronecker  delta; 

k 

tf'(S)  = the  convex  hull  of  S = { £ A.s.:  k is  a positive  integer, 

i=]  1 1 
k 

s,e  S Vi,  a.  >0  Vi,  l ^ = 1>; 

1 1 i=l  1 

intfc^S)  = the  interior  of  MS). 

2.  Characterization 

We  first  prove  a kolmogorov-type  characterization  theorem. 

THEOREM  2.  Suppose  f €.  C[X]  - R^.  Then  R*  = P*/Q*  £ R^  is  a best  approxi- 
mation to  f iff  there  is  no  P = p^e-j  +...+  pmem£P,  Q = q-j  ip-j  + . . •+qn<i'n  e.  <? 
satisfying 

(i)  sgn(P  + R*Q) (x)  = sgn(f(x)  - R*(x))  Vx  e XQ; 

(ii)  <3(y)  <0  vy  € YQ; 

(iil)  qj  > 0 if  qj-l; 

(iv)  qj  < 0 if  qt  = -1. 

Proof: 

( ♦):  Suppose  R*  is  not  a best  approximation.  Then  there  is  a better 

approximation  R = P/Qep^.  By  our  assumptions  in  the  previous  section, 

we  have  (max  L(x))/(min  i|i,(x))  < 1;  letting  a be  any  number  satisfying 
xeX  xeX 


(max  L(x))/(min  Mx))  < a < 1,  we  define  Q = q.i//,  + ...  + qnl/i  e Q by 
xex  xeX  1 11  n n 

Q = Q*  - a^.  Then  Q satisfies  Q(y)  < 0 Vy  G Yg,  Pj  > 0 if  q j = 1 , 

and  cfj  < 0 If  q*  = -1.  Now  define  P = P - P*.  Q = Q*  - Q + nQ,  where  n 

is  a positive  number.  For  x £ Xq,  we  have 

sgn(P  + R*Q) (x)  = sgn(P  - P*  + R*(Q*  - Q + nQ))(x)  = sgn(P  - R*Q  + nR*Q)(x) 

•\s 

= sgn{Q(x)[(R(x)  - R*(x)  + 

f\j 

s sgn[f(x)  - R*(x)  - (f(x)  - R(x))  + n-R-*^M]» 

Now  choosing  n so  small  that  I < II f - R*ll  - ||f  - R||  Vx  e XQ, 
we  have 

sgn(P  + R*0)(x)  = sgn(f(x)  - R*(x))  Vxe  XQ, 
so  (i)  holds.  For  y £ Yg,  we  have 

Q(y)  = Q*(y)  - Q(y)  + nQ(y)  < L(y)  - L(y)  + nQ(y)  < o, 
so  (ii)  holds.  If  q*  = 1,  we  have 

J 

<ij  * qj  - q j + nqfj  >.  1 - 1 + nqj  > 0,  so  (iii)  holds. 

If  q*  * -1 , we  have 

Qj  = qj  - qj  + nqfj  1-1  - (-1)  + nqj  < 0,  so  (iv)  holds. 

(*):  Suppose  there  exists  P6P,  QeQ  satisfying  (i)-(iv)  above.  Let 
X be  a small  positive  number.  Then 


P*  + xP  _ P*  a Pq*  + p*Q  = P + R*Q 
Q*  - XQ  Q*  (Q*-XQ)q*  Q*  - XQ  * 


P*  + p*  p X D*fl 

Thus  f - — = f - g * - x~ -5-,  and  using  the  arguments  of 

[2,  pp.  159-160]  it  can  be  shown  that  Ilf  - — ■*  -%\\  < Ilf  - £|| 

' Q*  - XQ  Q 

X sufficiently  small.  It  remains  only  to  show  that  P-*-  -+-A-  e R, 

Q*  - xq  L 


for  all 
for  X 


sufficiently  small. 
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Vy  6 Yq,  we  have 

Q*(y)  - xQ(y)  > Q*(y)  = L(y). 

Since  Y g is  compact,  there  exists  ? > 0 such  that  Q(y)  5 -c  Vy  €.  Yg.  Let 
Y|  * {x  6 X:  Q(x)  <~|},  Y2  = X - Yr  Then  Y2  is  compact,  with  Y2  n YQ  = 0. 

Let  v = min{Q*(x)  - L(x):  x eY2>  > 0.  Choose  X to  satisfy  0 < X < max(l!!  ||Q||  ) 

Then  if  x€  Y-j,  we  have  Q*(x)  - xQ(x)  > Q*(x)  ^L(x);  if  x€  Y2,  we  have 
Q*(x)  - X(5(x)  >_  L(x)  + vi  - xQ(x)  _>  L(x),  so  Q*(x)  - xQ(x)  >_  L(x)  Vx  e X. 
Finally,  if  q*  = 1 we  have 

q j - xqj  = 1 - Aqj  c ! , 
and  if  q*  = -1  we  have 

qj  - Aq.  = -1  - A^  > -1. 

so  choosing  X sufficiently  small  will  insure  that  |q*  - xcjj | <.  1 , j = 1 n. 

Thus  for  x sufficiently  small  we  have  - — --^3  eF.  and  ||f  - - — ^3|j  < ||f  - £*ll 

Q*  - XQ  L Q*  - XQ  Q* 


p* 

so  q*  is  not  a best  approximation. 


Q.E.D. 

We  can  now  prove  a "zero  in  the  convex  hull"  characterization  of  best 
approximations. 

THEOREM  3.  Suppose  f € C[X]  - Then  R*  = P*/Q*  <=FL  is  a best  approxi- 
mation to  f iff  the  origin  of  (m  + n)-space  lies  in  the  convex  hull  of  S 
(where  S is  the  set  defined  in  the  introduction). 


Proof:  By  the  theorem  on  linear  inequalities  [2,  p.  19],  0£  fl{ S) 

'V# 

iff  the  system  of  inequalities 


<z,  s>  > 0,  s 6 S 


6 


Is  consistent  (here  < , > denotes  Inner  product).  But  this  is  true  iff 
3 a vector  z = [z-j , ....  z^]1^  satisfying 

o(x)[z1e1(x)  + ...  + zmem(x)  + R*(x)(zm+1*1  (x)  + ...  + z^^fx))]  > 0 


Vx  e Xn; 


+ •••  + zm4-^n(y)  <0  W e Yn; 


nW-n'n' 


Vj  > 0 If  qj  - li  and 

Vi ' 0 1f  q3 = 

Letting  f = z,e,  + ...  + zm8m  and  Q • z^,*.,  + ...  + z^,  we  see  by 
Theorem  2 that  this  is  true  iff  R*  is  not  a best  approximation.  Thus 


0 € $4S)  iff  R*  is  a best  approximation. 
% 


Q.E.D. 


We  illustrate  the  application  of  this  theorem  with  the  following 
example. 

EXAMPLE  1.  Let  X = {0,  1},  f(x)  = x,  P = Jig  = the  set  of  all  polynomials 
of  degree  < 0,  Q = n?  L(x)  e .1,  R*(x)  = yy/(l  - .9x).  We  have  XQ  = {0,  1}, 
Y0  = (1).  I0  - til,  o(0)  = -1,  o(l)  = 1.  Thus 


0-j(x)  n 

a(x)  (R^Xx)  = a(x)  yy/(l-.9x)  , so  S 
(RA|)^(x)  yy/  (1  - . 9x) 

Solving  the  linear  system 


-x1  + x2  = 0 

" 17X1  + TTX2  " X3  + X4  = 0 

TTX2  “ X3  = 0 
+ X2  X3  X4  “ 1 


-1  1 

1 10 
11  ’ 11 


0 0 
-1  , 1 


yields  the  solution  x-|  = x2  - x^  = ^ x4  = Since  we  have 

X1  > 0 Vi , we  have  £€j$/(S),  so  R*  is  a best  approximation. 

We  may  observe  that  in  this  example  0 is  a positive  convex  combination 
of  exactly  m + n + 1 =4  vectors  in  S,  and  the  coefficient  matrix  used  is  nonsingular. 
Thus  ty  Cramer's  rule  (6-j  ,62,63)Te  y!/(S)  if  1 6-j  | , l^l*  |<S3I  are  sufficiently  smal  1 , so 
0 is  actually  in  the  interior  of  ^(S);  this  distinction  will  be  important  in  the 

*\j 

next  two  sections. 

The  next  example  illustrates  what  can  happen  if  our  assumption  that 

max  L(x)  < min  i|>,  (x)  is  violated. 
xeX  xeX 

EXAMPLE  2.  Let  X = [0,  1],  P » nQ,  Q = Ilj,  L(x)  = 1,  R*(x)  = l/(l+.5x). 

We  have 


' 0 
-1 

e s 

and 

‘o' 

1 

.0. 

0 

L J 

so  0£#(S)  regardless  of  whether  R*  is  a best  approximation  to  f or  not. 

Intuitively,  the  trouble  is  that  we  are  imposing  a double  restraint  on  the 

denominator  at  x = 0 which  ties  it  down  completely  there. 

The  next  example  shows  that  the  standard  alternation  characterization 

of  best  approximations  does  not  hold  in  our  setting. 

(3.5x,  0 < x £ 1 

2 + 1.5x,  1 < x < 2 , P = nn,  Q = iu, 

8 - 1.5x,  2 < x < 3 u 

L(x)  = .2,  R*(x)  * 1/(1  - .8x  + .2x2).  We  have  XQ  = (0,  1,  3},  YQ  = {2}, 

I0  * {1},  a(0)  = -1,  a(l)  = 1,  a(3)  = 1.  The  fact  that  R*  is  a best 

approximation  to  f is  shown  by  the  equality 


■ « 

-1 

1 

1 

0 

0 

0 

-1 

0 

+ TT 

2.5 

2.5 

2.5 

7.5 

+ 15 
34 

-1 

-2 

+ J- 

34 

1 

0 

= 

0 

0 

0 

2.5 

22.5 

-4 

0 

0 

4 

17 


r, 

[i 
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Without  denominator  restrictions  we  would  expect  (m-1 )+(n-l )+2-min(m-l -degree  of  P*, 

rvl -degree  of  Q*)  = four  alternating  points  for  f - R*;  since  there  is  only 

one  denominator  restriction  we  might  have  hoped  tor  three  alternating 

extreme  points,  but  there  are  only  two.  It  is  tempting  to  conjecture  that 

denominator  constraints  act  as  extreme  points  with  "sign  of  error" 

opposite  that  of  the  previous  extreme  point,  but  there  are  examples  which 

show  that  there  may  still  be  fewer  than  expected  alternating  extreme  points. 

Thus  there  does  not  appear  to  be  a simple  alternation  theorem  in  this 
setting.  Some  partial  results  under  suitable  Haar  assumptions  are  possible; 
however,  they  do  not  seem  to  add  much  insight. 


3.  Uniqueness  and  de  LaVallee  Poussin  Results 

Best  approximations  from  need  not  be  unique,  as  shown  by  the 
following  example. 


EXAMPLE 


_4.  Let  X = {0,  1},  f(x)  = x,  P = nQ,  Q = n2,  L(x)  s .1 


R*(x)  = Yp/(1  - .9x).  We  have  XQ  = {0,  1},  YQ  = {1},  IQ  = {!},  o(0)  - -1 , 
a(l)  = 1.  Thus 


a(x) 


Vx)  ' 
(fy)(x) 
(fyjHx) 
,(^3)(x). 


= a(x) 


1 

Tj/(l-.9x) 
yL/(l-.9x) 
yy/  ( 1 ~ . 9x ) 


, so  S = 


■-1  ■ 

'1  ' 

“o' 

"0“ 

1 

10 

-1 

1 

' 11 

11 

-i 

0 

0 

9 

10 

9 

9 

IT 

-1 

_0_ 

— 0 — l 

10 

Ln  J 

J 

and  R*  is  shown  to  be  a best  approximation  by  the  equality 


-1 

■ 1 ■ 

’ 0 ‘ 

'o' 

“o' 

1 

‘11 

4 

10 

11 

+ JO 

33 

-1 

1 

= 

0 

0 

10 

TT 

-1 

0 

0 

0 

m m 

10 

_ttJ 

r1. 

A 

0 

f 
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1 2 

But  R(x)  = iy/O  + P2X  + q3x  ) a^so  a best  approximation  for  any  q2»  q-j 
satisfying  |q2|  1 1.  h3l  < 1.  q2  + q3  = ",9  s1nce  llf  - Rll  = jy  = II f - R*||  • 
It  turns  out  uniqueness  is  assured  if,  unlike  the  situation  in  this 
example,  0 is  in  the  interior  of  #(S).  To  prove  this,  we  need  the  following 

*V» 

lemma. 


LEMMA  1 . Suppose  f <£  C[X]  and  R*eR^.  Suppose  Xq  is  an  arbitrary  compact 
subset  of  X,  and  S is  the  S of  earlier  theorems  with  Xg  replaced  by  Xg.  If 

0 feint  #(£),  then  P = 0,  Q = 0 is  the  only  solution  in  P,  (?  to  the  inequalities 

(a)  a(x)(P  + R*Q)(x)  _>  0 Vx  e XQ; 

(b)  Q(y)  <0  Vye  Yq; 

(c)  q..  > 0 if  qj  = 1; 

(d)  q.  < 0 if  qj  = -1. 

Proof:  Suppose  PfeP,  0 & 9 satisfy  (a)-(d).  Letting  z = [pn»...pm»q1»...fl„]T, 

'b  i m i n 

the  system  (a)-(d)  may  be  rewritten  as  <z,  s>  > 0 VseS.  Suppose  that 

'V.  'Vi  'Xi 

there  is  a z t 0 satisfying  these  inequalities.  Since  0 feint  #(S)»  for 
6 > 0 sufficiently  small  we  have  -Sz  €#(§).  By  Caratheodory’s  theorem 

'V 

[2,  p.  17]  for  some  integer  k<m  + n + 1 3 s, s,6S, 

k k ^ ™ 

An,  ...»  A.  > 0 with  7 X.  = 1 such  that  -6z  = t X.s . . Thus 

1 * j = l 1 ^ j=1 


-6<z,  z>  = <z,  -$z>  = <z. 


I X.-s .>  = T x.<z,  s.>  > 0,  which  is  a contradicti< 

• I T/ 1 i"-!  ^ 


Thus  z = 0,  so  P = 0,  Q = 0. 


Q.E.D. 


Although  we  will  not  use  it  in  this  paper,  the  converse  of  this  lemma 
can  also  be  shown  to  be  true. 

We  can  now  prove  uniqueness  if  0 fe  int<#(S). 
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THEOREM  4.  Suppose  f e C[X]  - RL  and  R*  = P*/Q*  e R^.  If  0 e int<£(S),  then 
R*  is  the  unique  best  approximation  to  f fromRL. 

Proof:  By  Theorem  3,  R*  is  a best  approximation.  Suppose  R=P/Q  e R^  were 
another  best  approximation.  Let  P = P-P*,  f)  = Q*-Q.  For  x£Xg,  we  have 
o(x)(P  + R*Q) (x)  = o(x)(P  - R*Q) (x)  = a(x)Q(x)[f (x)  R*(x)-(f (x)-R(x))]>0. 

For  y ^ Yq,  we  have 

Q(y)  = Q*(y)  - Q(y)  = L(y)  - Q(y)  < o. 

Finally,  if  q*  = 1,  we  have 

Qj  = qj  - qj  ■ 1 - qj  > 0 

and  If  qf  * -1,  we  have 

V 1}  - 1j  ■ - 1j  £ °- 

Thus  by  Lenina  1,  P = 0 and  Q = 0.  Thus  P = P*  and  Q = Q*,  so  R*  is  unique. 

Q.E.D. 

The  next  example  shows  that  the  converse  of  this  theorem  if  false. 

EXAMPLE  5.  Let  X = [0,  1],  f(x)  = 2 - 2x,  P = Q = Hg,  L(x)  = .1,  R*(x)  = |. 

We  have  X0  = {0,  1},  YQ  = 0,  IQ  = (1),  S = j[]],  [’]],  [j  |.  Here  R*  is  the 
unique  best  approximation,  but  0 £ int#tS). 

In  general,  we  always  have  0 £ int  */(S)  if  R*  is  a best  approximation 

'Vi 

with  Yg  = 0;  the  reason  is  that  0 €rint^(S)  implies  that  the  coefficients 
of  the  best  approximation  are  unique,  but  Yg  = 0 implies  that  for  some  a with 
0 < a < 1,  (aP*)/(aQ*)  is  another  best  approximation  in  R|_  with  different 
coefficients. 

The  hypotheses  of  Theorem  4 are  actually  sufficient  to  prove  strong 
uniqueness. 


THEOREM  5.  Suppose  R*  is  a best  approximation  to  f €rC[X]  with  0 <=  int  V(S). 
Then  there  is  a constant  y > 0 such  that  for  any  R£  R,  , ||f  - R||  >_||f  - R* \\ 
+ y||R  - R* ||  . 


11 


Proof:  (Sketch)  The  proof  follows  the  same  general  lines  as  the  proof 
of  strong  uniqueness  in  the  setting  where  the  denominator  is  merely  required 
to  be  positive  ([2,  p.  165]),  with  P - P*  playing  the  role  of  P in  the 
standard  proof  and  Q*  - Q playing  the  role  of  Q,  and  Lemma  1 of  this 
paper  used  in  place  of  Lemmas  1 and  2 in  [2],  The  other  major  change  is  that 
the  definition  of  c on  p.  lo"  of  [2]  is  replaced  by 

c = inf{max[-a(x)(P+R*Q)(x)]:  p e P,  Q <£•  Q,  0(y)  £ 0 \/y  <=  YQ, 
q . ^ 0 if  q*  = 1 , q,-  1 0 if  q*  = -1 , 
max{  ||P  + R*0||  , ||P ||,  ||Q||}  = 1}; 

the  extra  complication  in  the  last  equality  of  this  definition  is  needed 
since  otherwise  we  could  have  ||P  + R*Q||  = 1 with  ||Q||  and  ||P[|  arbitrarily 
large. 

Q.E.D. 

We  finish  this  section  with  two  de  La  Vallee  Poussin  estimates,  which 
give  lower  bounds  on  error  norms. 

THEOREM  6.  Suppose  f €:  C[X]  - R,  , R*€P,  (not  necessarily  a best  approxi- 

* 

mation),  and  Xn^0  is  some  compact  subset  of  X.  Suppose  OG.'V(S),  where  S is 

U 'V, 

the  S of  earlier  theorems  with  Xq  replaced  by  Xq.  Then  inf{  ||f  - R||  : RG  R|_ } 
>.min{|f(x)  - R*(x)|:  xg  Xq}. 

Proof:  Without  loss  of  generality,  we  may  assume  f(x)  f R*(x)  Vx  e Xq. 
Suppose  the  conclusion  of  the  theorem  is  false.  Then  3R  = P/Q  e F^  with 
|| f - R ||  < mint |f (x)  - R*(x)|:  x GXq}.  Let  P h P - P*.  Q = Q*  - Q. 
Proceeding  as  in  the  proof  of  Theorem  4,  we  get 
(a’)  o(x)(P  + R*Q)(x)  >0  VxG  XQ; 

(b)  Q(y)  <0  vy  €r YQ; 

(c)  Qj  > 0 if  qj  w 1; 


mm — ■ 
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(d)  Qj  1 0 if  qj  = -1. 

Since  OG*/(s),  Caratheodory ' s theorem  ([2,  p.  17])  implies  that  for  some 

-v  k 

integer  k < m + n + 1 , 3 ^ , ....  sk6S,  x] , ....  xk  >,  0 with  J x^  = 1 

^ £*1 
k 

such  that  l as  = 0.  Thus  there  are  nonnegative  indices  u,  v,  w with 


£=1 


W 


“ + v + w ’ k such  that  j,  + J,V4(yt> + ,1, ■ S 

where  , . . . » xy  £ Xg’»  y-| » • • • » yv  €L  Yg',  i^ , . . . , iw  Ig  and  = (0^  (x^) , 
•••»  em(x£)»  r*(xji)^i  (x£) ^*(x£^n(X£^’  We  will  next  show  that 


I VW(p  + R*Q)(XJ  i o. 


£=1  * 1 


We  have 


M 

1 X£°(x£)0i(x£)  ■ 0 for  1 - 1 m. 


£=1 


So  we  get 
u 


j JW»qJ  ifj  = ,»eIo 
lo 


otherwise, 
m n 


l A£CT(xJl)R*(xJi;)'<'j(x£)  = I xu+^j(y£)  - Xj  for  j = 1,  . ...  n,  where 
1 £ 1 

i 

+ R'QHx,)  * + R*(xJ!)^qj»j(x!()] 

* 1^t[J1X«o(x»>9i(xt)1  + 

‘ 0 * j=i5JcJ/u+e*j(y,t) ' sj]  ’ j, WjlvA)]  - hft 

‘ j,VM>  - j, 


by  properties  (b)-(d),  as  claimed.  But  by  property  (a1),  this  implies 

Ai  = ...  = A = 0.  But  this  in  turn  implies  that  0£#(S'),  where 
1 u x 

S'  * {£(y):  y £Yg}  Utqje^:  j € IQ}.  Thus,  If  XQ  = (x  <=  Xi|f(x)-R*(x)| 
* llf  ’ R*ll  > as  before,  we  have  0e  ^({c(x)x:  x e Xn>  US').  Thus,  by 

'V/  *\,  U 


kAmai  me'MvdibmmiaAMIm  Mil  i iW 


- - - ■ 
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Theorem  3,  R*  is  a best  approximation  to  f.  Thus  inf{  ||f  - R||  : R eFj_} 

« || f - R* ||  * max|f(x)  - R*(x)|  ^min{|f(x)  - R*(x)|:  x£.Xq},  contradicting 
xeX 

the  assumption  that  the  conclusion  of  the  theorem  is  false. 

Q.E.D. 

If  we  assume  that  0 e int  #(S),  we  can  prove  the  following  stronger 
result. 

THEOREM  7.  Suppose  f € C[X],  R*  = P*/Q*  e F^  (not  necessarily  a best 

f\,  ^ 

approximation),  and  Xq  t 0 is  some  compact  subset  of  X.  Suppose  0 € int  MS), 

- *\j 

where  5 is  the  S of  earlier  theorems  with  Xq  replaced  by  Xq.  Then  for  every 
R = P/Q  e Fl  with  R £ R*  we  have  max{  |f  (x)  - R(x)  | : xG  Xq} 

> m1n{|f(x)  - R*(x)|:  x e XQ}. 

Proof:  Suppose  the  conclusion  is  false.  Let  P = P - P*,  Q = Q*  - Q. 

Then  3 R €.  FL  with  R t R*  and  max{ |f (x)  - R(x) | : x e XQ> 

% 

_<  min{|f(x)  - R*(x)|:  xe  Xq}.  Proceeding  as  in  the  proof  of  Theorem  4, 
we  get 

(a")  0(x)(P  + R*Q)(x)  >0  Vx  € XQ; 

(b)  Q(y)  <0  vy  e Y0; 

(c)  qj  > 0 if  qj  ■ 1; 

(d)  < 0 if  qj  = -1. 

Thus  Lemma  1 implies  P = 0,  Q = 0.  Thus  R = R*.  contrary  to  assumption. 

Q.E.D. 

We  observe  that  Example  4 with  X replaced  by  {0,  .1,  1}  and  Xq  = {0,  1} 
shows  that  the  conclusion  of  the  theorem  may  fail  if  0 £ int  #(S). 


— «-rr 
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4.  Computation  of  Best  Approximations 


The  differential  correction  algorithm  introduced  by  Cheney  and  Loeb 


[3]  and  discussed  further  by  Barrodale,  Powell,  and  Roberts  [1]  can  be 


modified  to  compute  approximations  from  F|_  by  inserting  extra  constraints 
to  force  Q(x)  ^.L(x).  We  have 


ALGORITHM  (Restricted-denominator  differential  correction  - RDDC). 

(i)  Choose  Pq/Q0<^  *V> 

(ii)  Having  found  Pk/Qk<s  FL  with  ||f  - Rk||  = Ak,  choose  Pk+1 , Qk+1 
as  a solution  to  the  problem 


|f (x)Q(x)  - P(x)|  - AkQ(x) 
minimize:  max r..\ 


subject  to:  Idj  I 1 1 » j = 1 * • • • » n,  and  Q(x)  >_  L(x)  \/x  ^ X; 

(iii)  Continue  until  some  stopping  criterion  is  met. 


One  common  stopping  criterion  is  to  stop  when  (Ak  - Ak+^)/\  < e 
for  some  prescribed  e > 0,  selecting  Rk+^  as  the  approximation  returned  by 
the  algorithm  if  Ak+^  < Ak,  and  selecting  Rk  otherwise.  A convenient  way 
of  choosing  Pq/Qq,  w^1c^  often  is  considerably  more  efficient  than  making 
some  arbitrary  choice  such  as  Pq/Qq  s ^ (see  Lee  anCl  ^ot:)erts  [5]  for 
numerical  evicence  in  the  unrestricted-denominator  case),  is  to  minimize 


max|f(x)Q(x)  - P(x)|  subject  to  Q(x)  >.  L(x)  Vx  € X and  |q-|  <.1  for 
xeX  J 


j — 1,  ...,  n. 


Using  the  techniques  of  Barrodale,  Powell,  and  Roberts  [1],  we  prove 


THEOREM  8.  The  RDDC  algorithm  converges  monotonically  and  at  least  linearly. 


Proof:  Let  M = max  £ |i^(x)  | (thus  ||Q||  < M for  all  Q e Q with  |q A < 1 ). 

xeX  j=l  3 3 ~ 

Suppose  Rk  is  not  a best  approximation.  Let  R*  be  a best  approximation  with 


A*  * ||f  - R* ||  . Let  6 = min  L(x)  > 0.  We  have 

xeX 


IS 


|f(x)Qk+1(x)  - Pk+1(x)|  - AkQk+1(x)  |f(x)Q*(x)  - P*(x)|  - AkQ*(x) 

3 S ipsj 


max{[|f(x)  - R* ( x ) | - Ak]-^|||-}  < [a*  - Ak]-|  < 0 


xeX 
VX  e x. 


Pk+1 M 


■ pk+i<x), 

V]<eX-  |f(x)  - 4k  ‘ 0 

Ak+i  < Ak,  so  the  convergence  is  monotonic. 
Vx  e X,  we  have 


Pk+1 W 


|f(x) " Ciw1 ' Ak  1 [A*  " Ak]T0l7^y-  [A*  ’ AkH^i} 


Ql(x) 


r <$  6 1 


<k+l 


k+1 


Vr  Ak  - [A*  - 


Vl  “ A*  " W ' -“^(Ak  " A*> 

Ak+^  - A*  £ (1  - jJJ7)(Ak  - A*) 

Ak  converges  at  least  linearly  to  a*. 


Q.E.D. 

We  observe  that  this  theorem  is  stronger  than  the  corresponding 
theorem  in  the  unrestricted-denominator  case  [1,  Theorems  1 and  2]  in 
that  it  gives  information  on  the  rate  of  convergence,  and  finiteness  of 
X is  not  required  to  prove  convergence.  Finiteness  of  X is  required, 
however,  in  order  to  run  the  algorithm  in  the  usual  way. 

The  following  lemma  was  proved  and  used  by  Barrodale,  Powell,  and 
Roberts  [1]  for  the  case  P = nn_^  and  Q = nn_^.  The  Haar  subspace 
assumption  of  our  lemma  is  equivalent  to  their  assumption  that 
m1n(m  - 1 - degree  of  P*,  n - 1 - degree  of  Q*)  * 0,  with  P*  and  Q* 
having  no  common  nonconstant  factors  in  their  setting. 
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LEMMA  2.  Suppose  X contains  at  least  m + n + 1 distinct  points  and 

R*  = P*/Q*  € Rq  = {P/Q:  P = p101  + ...  + pm0m€  P,  Q = + ...  + q^  € Q, 

Q > 0 on  X,  max  IqJ  = 1}.  Suppose  that  the  space  spanned  by 
l<j<n  J 

{6l Qm»  R*^i » •••»  R*i|>n>  Is  a Haar  subspace  of  dimension  m + n - 1; 

that  is,  the  space  has  dimension  m + n - 1,  and  no  nontrivial  element  of 
it  has  more  than  m + n - 2 distinct  zeros  in  X.  Then  He  > 0 such  that  for 

all  R = P/Q  €R0  we  have  ||Q  - Q*||  <Q  ||R  - R*||  . 

Proof:  Dua  and  Loeb  [4]  prove  this  lemma  in  the  case  where  X = [0,  1], 

P * nm-i»  anc*  *3  = nn-l*  kut  tfieir  Proof  requires  these  extra  conditions 
only  in  proving  that  if  Q >.  0 on  X and  P = R*Q  on  X,  then  P = P*  on  X and 
Q = Q*  on  X.  This  fact,  however,  follows  from  an  argument  of  the  type 
given  by  Cheney  [2,  p.  165]. 

Q.E.D. 

We  can  now  prove  quadratic  convergence  of  the  RDDC  in  some  circumstances 

THEOREM  9.  Suppose  X contains  at  least  m + n + 1 distinct  points, 

R*  = P*/Q*  € R^  is  a best  approximation  to  f €•  C[X]  - R^,  and  the  space 

spanned  by  (Q-j , ...»  0m,  R*^ R*<|>n}  a ^aar  su^sPace  of  dimension 

m + n - 1.  Suppose  that  either  of  the  following  two  conditions  holds: 

(A)  Yq  = 0 
or 

(B)  0 eint#(S). 

Then  the  rate  of  convergence  of  the  restricted-denominator  differential 
correction  algorithm  is  at  least  quadratic. 

Proof:  If  some  R^  is  a best  approximation  the  conclusion  of  the  theorem 
is  true,  so  we  assume  this  is  not  the  case. 
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Then  the  approximations  produced  by  the  RDDC  algorithm  (except 

possibly  Pn/Qn)  satisfy  the  normalization  max  |q.-|  = 1,  since  otherwise 

lijiP  3 

the  (negative)  minimum  computed  in  step  (i i ) of  the  algorithm  could  be 
decreased  by  renormalization.  Since  we  may  also  assume  R*  satisfies 
this  normalization,  the  hypotheses  of  Lemma  2 hold  for  R*  and 
R^,  k >.  1 . We  now  claim  that  if  condition  (A)  holds,  then  R* 
Is  a best  approximation  to  f from  Fq.  To  see  this,  we  note 
that  by  Theorem  3 we  have  0 <?-7>/({o(x)x:  x e-  XQ1  U J e Iq>) 

(see  introduction).  If  there  were  R = P/Q  £ Fq  satisfying  ||f  - R|| 

< || f - R* ||  , letting  L(x)  = lmin(min  L(x),  min  Q(x))  we  have  R*  e 

* xeX  xeX  L 

and  R € F£,  with  Q*  > L and  Q > L.  But  the  convex  hull  statement  above 

and  Theorem  3 now  imply  that  R*  is  a best  approximation  from  R£,  which 

is  a contradiction.  Thus,  if  condition  (A)  holds,  the  strong  uniqueness 

of  R*  holds  by  a theorem  in  Cheney  [2,  p.  165],  while  if  condition  (B) 

holds,  strong  uniqueness  holds  by  Theorem  5.  With  the  strong  uniqueness 

of  R*  and  the  conclusion  of  Lemma  2 available,  the  rest  of  the  proof  is  as 

given  by  Barrodale,  Powell,  and  Roberts  [1,  Theorem  3]. 

Q.E.D. 

As  noted  earlier,  conditions  (A)  and  (B)  of  this  theorem  are  mutually 
exclusive;  Example  4 shows  that  they  are  not  exhaustive.  Under  the 
hypotheses  of  Theorem  9 the  absolute  values  of  the  differences  of  the 
coefficients  of  R^  and  R*  can  be  shown  to  be  bounded  by  sequences  which 
converge  quadratically  to  zero. 

It  is  sometimes  desirable  to  ignore  the  function  f at  some  points  of 
X,  but  still  apply  the  denominator  restrictions  on  all  of  X;  the  theory  of 
this  paper  goes  through  unchanged  if  the  subset  of  X on  which  f is  to  be 
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approximated  is  compact.  This  situation  is  illustrated  in  the  following 
example,  where  f is  the  desired  magnitude  squared  response  of  a digital 
filter,  and  we  do  not  wish  to  approximate  f in  the  "transition  band" 

(.1,  .11). 

EXAMPLE  6.  Let  X * {0,  .005,  .01,  .015 5}, 

fl » 0 £ x <_  .1  g 

f(x)  *<  undefined,  .1  < x < .11,  P * Q * { £ a^cos(2ir(i  - 1 )x ) } , 

1.14,  .11  < x < .5  1=1 

L(x)  = .125.  Applying  the  RDDC  algorithm  on  a CDC  CYBER  172,  which  has 
roughly  14  digits  of  accuracy,  we  get  (rounded  to  5 places) 

R*(x)  s (.34408+. 351 85  cos  2ttx  + .08270  cos  4ttx  + .15552  cos  6irx 

+ .31629  cos  8nx  + .23541  cos  10irx)/(l  .00000  + .27473  cos  2nx 

- .56073  cos  4irx  - .04879  cos  6irx  + .68555  cos  8irx  + .37588  cos  IOitx), 

with  A*  = .13945,  XQ  = {0+,  .085",  .1+,  .IT,  .125+,  .215',  .34+,  .345+, 

.405',  .41",  .5+}  (where  the  sign  indicates  the  sign  of  f - R*),  YQ  = {.105}, 
I0  = {1}.  Nine  iterations  were  required,  with  the  quantities  Ak  - A*  (for 
k * 0,  1,  ....  8)  being  approximately  1 x 10'1,  8 x 10"2,  2 x 10'2,  1 x ", 

4 x 10~4,  1 x 10'6,  8 x 10'12,  2 x 10'14,  8 x 10'15;  this  sequence  indicates 
the  eventual  quadratic  nature  of  the  convergence,  up  to  machine  accuracy. 

We  finally  remark  that  the  theory  of  this  paper  can  also  be  extended 
to  include  restricted  range  conditions  and  a positive  continuous  multipli- 
cative weight  function  w;  that  Is,  we  may  further  require  the  functions  R 
In  to  satisfy  R < u on  X|  and  R > t on  Xg,  where  u and  i are  given 
continuous  functions  defined  on  compact  subsets  X^  and  of  X,  respectively, 
and  we  wish  to  minimize  ||w*(f  - R)||  Instead  of  ||f  - R||  . The  set  S must 
be  expanded  to  include  the  vectors  of  the  form  a(x)x  (xe  Zq),  where 
Zq  ■ (x  € X:  R*(x)  = u(x)  or  R*(x)  * z(x)}  and 
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„ (-1  If  R*(x)  - u(x) 

a(x)  = \ 

[l  if  R*(x)  = A(x). 

In  this  setting,  the  assumption  max  L(x)  < min  <h(x)  must  be  replaced 

X€X  xeX 

by  the  assumption  that  there  exists  a member  of  RL  satisfying  all  the 
restrictions  (denominator  and  restricted  range)  strictly.  The  restric- 
tions P <_  Q*u  on  X-|  and  P _>  Q*t  on  X-|  must  be  added  to  the  RDDC  algorithm, 
and  in  the  expression  to  be  minimized  in  steps  (i)  and  (ii)  P(x)  must  be 
replaced  by  w(x)P(x).  The  results  of  this  paper  still  hold  essentially 
as  stated  (with  a few  minor  changes  in  the  proofs)  except  for  Theorems  2 
and  9 and  Lemma  1;  in  Theorem  2 we  must  add  the  condition  sgn(P+R*Q) (x)=a(x) 
Vx  e Zgj  in  Lemma  1 we  must  add  the  condition  a(x)(P  + R*Q)(x)  ^ 0 Vx  e ZQ; 
and  in  Theorem  9 we  must  add  to  condition  (A)  either  Zq  = 0,  or  f _<  u on 
X-j  and  f > n on  X2.  For  numerical  examples  in  this  extended  setting,  see 
McCallig  [6]. 
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20.  Abstract  (cont) 

' denominators  are  merely  required  to  be  positive.  A modified  differential 
correction  algorithm  is  presented  and  is  proved  to  always  converge  at  least 
linearly,  and  often  quadrat ically. 
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